Fast and Accurate Maximum Inner Product Recommendations on Map-Reduce
نویسندگان
چکیده
Personalization has become a predominant theme in online advertising; the internet allows advertisers to target only those users with the greatest chances of engagement, maximizing the probability of success and user happiness. However, a näıve approach to matching users with their most suitable content scales proportionally to the product of the cardinalities of the user and content sets. For advertisers with large portfolios, this quickly becomes intractable. In this work, we address this more general top-k personalization problem, giving a scalable method to produce recommendations based on personalization models where the affinity between a user and an item is captured by an inner product (i.e., most matrix factorization models). We first transform the problem into finding the k-nearest neighbors among the items for each user, then approximate the solution via a method which is particularly suited for use on a map-reduce cluster. We empirically show that our method is between 1 and 2 orders of magnitude faster than previous work, while maintaining excellent approximation quality. Additionally, we provide an open-source implementation of our proposed method, this implementation is used in production at Etsy for a number of large-scale personalization systems, and is the same code as used in the experiments below.
منابع مشابه
Asymmetric LSH (ALSH) for Sublinear Time Maximum Inner Product Search (MIPS)
We present the first provably sublinear time hashing algorithm for approximate Maximum Inner Product Search (MIPS). Searching with (un-normalized) inner product as the underlying similarity measure is a known difficult problem and finding hashing schemes for MIPS was considered hard. While the existing Locality Sensitive Hashing (LSH) framework is insufficient for solving MIPS, in this paper we...
متن کاملA Comparative Study of Fuzzy Inner Product Spaces
In the present paper, we investigate a connection between two fuzzy inner product one of which arises from Felbin's fuzzy norm and the other is based on Bag and Samanta's fuzzy norm. Also we show that, considering a fuzzy inner product space, how one can construct another kind of fuzzy inner product on this space.
متن کاملNORM AND INNER PRODUCT ON FUZZY LINEAR SPACES OVER FUZZY FIELDS
In this paper, we introduce the concepts of norm and inner prod- uct on fuzzy linear spaces over fuzzy elds and discuss some fundamental properties.
متن کاملOrthogonality preserving mappings on inner product C* -modules
Suppose that A is a C^*-algebra. We consider the class of A-linear mappins between two inner product A-modules such that for each two orthogonal vectors in the domain space their values are orthogonal in the target space. In this paper, we intend to determine A-linear mappings that preserve orthogonality. For this purpose, suppose that E and F are two inner product A-modules and A+ is the set o...
متن کاملMaximum A Posteriori Inference in Sum-Product Networks
Sum-product networks (SPNs) are a class of probabilistic graphical models that allow tractable marginal inference. However, the maximum a posteriori (MAP) inference in SPNs is NP-hard. We investigate MAP inference in SPNs from both theoretical and algorithmic perspectives. For the theoretical part, we reduce general MAP inference to its special case without evidence and hidden variables; we als...
متن کامل